Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm
نویسندگان
چکیده
In linear mixed models, the assumption of normally distributed random effects is often inappropriate and unnecessarily restrictive. The proposed approximate Dirichlet process mixture assumes a hierarchical Gaussian mixture that is based on the truncated version of the stick breaking presentation of the Dirichlet process. In addition to the weakening of distributional assumptions, the specification allows to identify clusters of observations with a similar random effects structure. An ExpectationMaximization algorithm is given that solves the estimation problem and that, in certain respects, may exhibit advantages over Markov chain Monte Carlo approaches when modelling with Dirichlet processes. The method is evaluated in a simulation study and applied to the dynamics of unemployment in Germany as well as lung function growth data.
منابع مشابه
Clustering in linear mixed models with Dirichlet process mixtures using EM algorithm
SUMMARY: In linear mixed models the assumption of normally distributed random effects is often inappropriate and unnecessary restrictive. The proposed Dirichlet process mixture assumes a hierarchical Gaussian mixture. In addition to the weakening of distributions assumptions the specification allows to estimate clusters of observations with a similar random effects structure identified. An Expe...
متن کاملClustering in Additive Mixed Models with Approximate Dirichlet Process Mixtures using the EM Algorithm
SUMMARY: We consider additive mixed models for longitudinal data with a nonlinear time trend. As random effects distribution an approximate Dirichlet process mixture is proposed that is based on the truncated version of the stick breaking presentation of the Dirichlet process and provides a Gaussian mixture with a data driven choice of the number of mixture components. The main advantage of the...
متن کاملLearning Task Relatedness via Dirichlet Process Priors for Linear Regression Models
In this paper we present a hierarchical model of linear regression functions in the context of multi–task learning. The parameters of the linear model are coupled by a Dirichlet Process (DP) prior, which implies a clustering of related functions for different tasks. To make approximate Bayesian inference under this model we apply the Bayesian Hierarchical Clustering (BHC) algorithm. The experim...
متن کاملMixture of linear mixed models for clustering gene expression profiles from repeated microarray experiments
Data variability can be important in microarray data analysis. Thus, when clustering gene expression profiles, it could be judicious to make use of repeated data. In this paper, the problem of analysing repeated data in the model-based cluster analysis context is considered. Linear mixed models are chosen to take into account data variability and mixture of these models are considered. This lea...
متن کاملSmall-Variance Asymptotics for Exponential Family Dirichlet Process Mixture Models
Sampling and variational inference techniques are two standard methods for inference in probabilistic models, but for many problems, neither approach scales effectively to large-scale data. An alternative is to relax the probabilistic model into a non-probabilistic formulation which has a scalable associated algorithm. This can often be fulfilled by performing small-variance asymptotics, i.e., ...
متن کامل